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Matheaatlcal  Models  of  tho  Event  Related  Potential 

Earl  Hunt 

The  University  of  Washington 

The  event  related  potential  (ERP)  la  an  electrical  signal 
rovlded  by  the  brain  in  reaponae  to  external  atlaull.  ERPa  in 
uaana  are  recorded  f roe  aurface  electrodes.  The  resulting 
aplltude  s  tlae  wsvefora  ia  believed  to  contain  aoae 
nforaatlon  about  the  braln*a  response  to  the  stlaulus  (the 
signal*)  but  that  signal  la  eaersed  In  extraneous  lnforaatlon 
■solas*)  about  brain  events  not  associated  with  cognitive  or 
vert  rosponaos.  A  variety  of  aathoaatlcal  techniques  have 
sen  proponed  for  Isolating  signal  eoaponenta  of  the  wavefora. 
a  an  uauaually  creative  paper,  Doaehln  and  Heffely  (1979) 
olntod  out  that  there  la  a  nathonatleal  lsoaorphlsa  between 
eeordlags  fron  electrodes  and  tho  data  obtained  froa  classic 
oraoaallty  and  intelligence  teats,  and  that  this  lsoaorphlsa 
ould  bo  used  to  apply  a  widely  used  psyohoaetrlc  aetbod  of 
ata  analysis,  Prlnolpal  Coaponenta  Analysis  (PCA),  to  the 
aalysls  of  ERP  records.  Tho  Donohln  and  Heffely  proposal  has 


This  paper  is  a  critique  of  the  logic  behind  PCA. The 
aatheaatloal  assnaptlons  underlying  the  aethod  will  first  be 
reviewed,  stressing  tbs  plausability  of  the  assuaptlons  as 


IMF  Aaalyala 
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stataaanta  about  payehophyalology .  Tbo  eritlqua  will  •■phaal 
poaalbla  dlatortlona  of  raaulta  that  oould  bo  lntroducad  in 
altuatioaa  in  which  aoaa  of  the  nathoaatioal  aasuaptlona  ara 
not  dafaaalbla  oa  paychophyalologlcal  grounda. 


Tha  Nathaaatlea  of  tha  PCA 

Tha  raoardlng  froa  a  aingla  alaotreda  la  aoat  naturally 
aaad  aa  a  variation  in  aaplltuda  ovor  tiaa, 


(1)  x[ 1 > t ]  a  f(t), 

whara  i[l,t]  la  tha  pataatial  of  tha  1th  aloatroda  at  tiaa  t  (t 
a  1...T),  and  f  la  a  oaatlanoaa  fwaatlon  of  tiaa.  Thla  will  ba 
eallad  tha  "functional”  rapraaaatatlon  of  tha  KIP.  By 
convention.  tha  atlaalao  la  proaantad  at  tiaa  t*0«  Xn 
practice,  s(t]  la  aaaplod  at  dlaarata  tiaa  Interval*,  ao  tha 


(2)  x(l)  • 


tll»0j*— 


xll.t] 
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In  T  dlaonslonal  spses.  This  will  bn  called  tbs  'Euclidean* 
representation  of  the  EBP. 

Because  there  Is  a  one:one  correspondence  between  equation 
(1),  at  discrete  tine  points,  and  equation  (2),  any  data 
analysis  technique  that  unoovers  regularities  In  the  Euclidean 
representation  should  be  interpretable  in  terns  of  a  regularity 
In  the  aore  natural  functional  representation.  This  fact  Is 
the  logical  basis  for  Donchln  and  Heffely's  proposal.  Principal 
Coaponents  Analysis  is  a  teohnlque  for  unooverlng  regularities 
in  a  luelidean  representation.  Note,  though,  that  the  proposal 
is  baaed  on  e  "backwards*  arguaent,  soaethlng  that  la  true  in 
the  Euolldeaa  representation  aunt  be  true  la  the  functional 
representation.  But  what  about  arguaent  In  the  other 
dlreetioa?  The  proposal  is  valid  to  the  extent  that  the 
tranalatioa  froa  the  funotioaal  to  the  tuelidenn  representation 
retaiaa  what  is  laportant  in  the  BBP  record.  To  evaluate  this 
issue  one  aust  oonsider  how  signal  and  noise  ooaponents  coablne 
to  produce  the  original  fuaotioaal  representation. 

The  BBP  is  assuaed  to  he  produoed  by  the  suaaatlon  of  B 
ooaponeat  wave  foras.  These  are  sawed  by  their  polarity  and 
the  approximate  tiaea  of  their  peak  aaplitudea,  as  ia  N100, 
P300,  eta.  The  first  aoa-xoro  point  of  a  eoapoaent  wave  fora 
will  he  called  its  latenoy.  This  is  slightly  different  froa 


1 


la 
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I 

tha  oonvantlon  In  payehophyalology ,  which  la  to  daflna  latency 
by  tbo  tlna  of  tba  poak  aaplltuda.  Tha  dlffaranca  doaa  not 
•ffaot  tha  logic  of  tha  arguaent  to  ba  praaantad. 

Tha  natural  way  to  thlak  of  a  ooaponant  viva  fora  la  aa  a 
potaatlal  that  aaauaaa  a  non-taro  aaplltuda  at  tlaa  L[K),  and 
follova  a  flxad  tlaa  eouraa,  tilth  a  aaxlaua  abaoluta  axcuralon 
(aaplltuda)  A[l,k].  Tha  tlaa  aouraa  of  tba  uava  la  flxad  aeroaa 
all  raearda,  but  tha  aaplltuda  aay  vary  froa  raoord  to  raoord. 

Thua  aaoh  uava  fora  oould  ba  thought  of  aa  having  a  atandard 
fora,  that  baglna  at  taO,  a  aaxlaua  abaoluta  aaplltuda  of  t.and 
taking  tha  valua  glk](t)  at  tlaa  t.  A  atandard  fora  can  ba 
eonvartad  Into  aa  aatual  aaapaaant  fora  by  aultlplying  tha 

a 

ataadard  fora  by  tha  aaplltuda  A[l,k)  that  eharaotarlxaa  tha 
raoord; 

(3)  •talua  of  kth  fora  la  raoord  l  at  tlaa  f  .  A[l,k) 
glhl(t). 

Tha  ooaponant  foraa  ana  to  produoa  a  "trua*  raoord, 

y(l,t],  whara 

* 
a 

-  K 

<«)  yCl.tJ  ■  ?  A[i,k)  g(k]  (t). 

Kxl 

Tha  obaarvad  raoord  la  darlvad  froa  tha  trua  raoord  by  tha 
nddlt|on  of  an  arror  ooaponant  a[l](t),  that  haa  an  axpaotad 
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value  of  aoro  and  aoae  unknown  varlano*. 

Tha  nodal  for  the  obaarvod  record  bacoaoa 

(5)  s[l.t]  »  ^  A[l,k]  ♦  o[ll(t). 

K*l 

It  la  doalrabla  to  aaauaa  that  tha  *(l](t)'a  ar* 
ladopoadoat  over  1  (raoorda)  and  t  (tie*  polnta).  Thla  la 
oftoa  uaroallatle.  For  lnataaee.  aoat  avanta  that  would 
preduee  a  perturbation  la  tha  alaetrloal  record  would  extend 
ovor  aovoral  tlaa  polnta.  To  avoid  tbla  problea,  the  typical 
aolutloa  la  to  uaa  aa  a  •raoord*  tha  average  wavefora  recorded 
over  aoaa  fairly  largo  Busbar f  a  (a  100).  of  trlala  obaerved 
uador  thaoretloally  ldaatloal  ooadltloaa.  Tbo  ratloaale  for 
thla  la  that  tha  true  value  ytl.t],  ahould  raaala  eeoataat,  aad 
o(l](t)  ahould  aovo  to  Ita  expectation,  aero.  Writing  X  aad  T 
for  tha  auaaod  wave  foraa, 

(«)  KXll.t]}  e  TCl.tJ  ♦  B(o(l](t>) 

•  m.ti 

thua  roaovlag  tha  e[l](t)'e  fra a  eoaoera. 


lafaraatlaa  about  tha  uadarlylag  eaapanaata  fraa  aa  aaalyala  of 


o' 


» 


Two 


1*. ... 
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types  of  information  are  of  interest;  tbe  standard  forns 
(g[k](t))  for  each  ooaponent  waves,  and  the  amplitude  A[i,lc]  of 
eaoh  fora  ooaponent  in  eaob  record.  These  values  are  the 
results  of  PCA.  In  order  to  define  a  aatheaatioally  tractable 
problea,  however,  PCA  deaaada  sons  further  asauaptiona  about 
the  date.  They  will  now  be  deseribed. 

What  Priaoipal  Coaponenta  Analysis  Poos 

Xa  order  to  understead  how  PCA  works  a  geometric 
preaeatation  is  useful.  Suppose  that  tbe  aodol  of  tho  KRP  that 
baa  been  presented  were  exactly  true.  It  would  then  be 
'possible  to  plot  any  KIP  rooord  as  a  point  dofined  by  its  value 
oa  eaob  of  tbe  T  tiae  points.  This  is  shown  for  a  swara  of  SRP 
reoorda  ia  figure  1.  Oae  oan  iaaglno  a  'best  fitting 
ellipsoid'  that  would  fit  tble  swara.  as  shown  la  the  figure. 
PCA  is  a  aatbeaatloal  technique  for  deteralnlag  tbe  best 
fitting  ellopsold  and  finding  its  aaee.  Tbe  data  points  will 
be  plotted  in  N-dlaenaloaal  epaoe,  ao  tbe  ellipsoid  will  have  I 
axes.  However,  tbe  original  dlaenalons  (l.e.  the  tiae  points) 
will  typically  have  highly  oorrelated  values.  Therefore,  the 
points  will  tend  to  lie  la  a  K  dlaenslonal  ellipsoid 
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Figure  1  here 


•■bedded  in  the  N-dlaenelonal  space.  A  whimsical  exaaple  aay 
help  to  Illustrate  this.  A  pizza  la  an  'alaoat*  two 
diaensional  ellipsoid  lying  In  a  three  dlaenslonal  apace. 
Coaputer  prograaa  for  FCi  extract  the  axea  In  descending  order 
of  their  lengths,  until  It  is  felt  that  further  extractions  of 
axes  would  be  atetlstl eally  unrelleble.  Several  criteria  have 
been  offered  for  asking  this  determination.  The  conventional 
criterion  la  'the  eigenvalue  of  the  coaponent  la  less  than 
one.'  Thle  la  an  arbitrary  aatheaatical  determination, 
soaewhat  akin  In  aeanlng  to  'per  cent  of  variance  accounted 
/or.'  More  Justifiable,  but  aore  complicated,  criteria  have 
also  boea  offered.  No  atteapt  will  be  aade  to  defend  any  of 
thea.  In  aoet  practical  eases  they  seea  to  usually  lead  to  the 
same  decision. 

Once  the  ellipsoid  has  been  found,  it  Is  possible  to  plot 
the  projection  of  any  point  on  each  of  Its  axes.  This  la  shown 
in  Figure  2,  whlob  plots  a  single  point,  1,  with  reference  to 
two  time  axes  (tel,  2)  and  two  ellipse  axes  (k«1,  2).  Next, 
oonalder  the  projection  of  point  1  on  soae  tine  axis,  t.  Froa 
•leaeptary  analytic  geoaetry,  thla  will  be.  In  tbe  exaaple 
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(7)  >[l,t]  >  BU,1J  cos  (  £  1 . 1 1  >  ♦  B  [  1 ,2]coa(  [  2 ,  t  ] ) 

ahara  B[l,k]  la  th*  projaotlon  of  data  point  1  on  tha  kth  axla 
of  tha  alllpaa  and  Q  [k,t]  la  tha  angl*  batwaan  tha  kth  axla  of 
tha  alllpaold  and  th*  1th  tin*  dlaanalon.  B[lpk]  la  callad  tha 
factor  near*  of  racord  1  on  coaponant  k.  and  coa^tk.t]  la  th* 


loadlaa 


int  k. 


(7).  for  k  dlaanalona  and  T  tla* 


point a,  la 


(8)  X[l,t]  >  £  B[l,k]eoa0[k,l] 


finally. 


K  T,  l.a.  that  tha  axtraotlon  of 


eaapoaanta  haa  baan  haltad  at  aoaa  point  laaa  than  th*  full 


(To  llluatrata.  auppoa*  that  only 


2.)  Than  It  would  b* 


naoaaaary  to  lntroduoa  a  oorraotlon  tarn,  o.  for  aach  data 
point  at  aaob  tlaa,  to  aeoouat  for  a  data  point'*  aovaaent  In 
'hlghar  dlaanalon'  than  thoa*  rapraaantad  In  th*  alllpaold. 

Lot  C[l](t)  ha  tha  oorraotlon  to  th*  1th  point  on  th*  1th  tla* 
dlaanalon.  Than  aquation  (8)  axpanda  to 


I[l,t]  a  (  B[ 1 , k ]  00*  [k.t]  a  C[l](t). 
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Equation  (9)  naps  directly  onto  equation  (5),  the  mathematical 
modal  of  the  ERP,  by  substituting  A[i,k],  g[k](t),  and  e[l](t) 
for  the  B,  coa@  ,  and  C  terns.  Therefore  It  Mould  seen  that 
PCA  solves  the  problem  of  snslyzlng  the  ERP  data  to  find  the 
valves  required  by  the  model. 

In  a  sensei  It  does.  The  standard  fora  of  a  component  la 
determined  by  the  loading  of  the  scores  Involving  a  factor 
(l.e.  by  the  vector  (oos  [kit]),  varying  over  t),  and  the 
amplitude  of  a  oonponent  In  a  particular  record  Is  given  by  the 
fmotor  score  for  that  record. 

Howeveri  the  solution  depends  critically  upon  two 
assumptions.  These  are  that  the  distribution  of  the  data, 
plotted  In  the  T  dimensional  tine  space  is  accurately 
characterised  as  an  ellipsoid,  and  that  the  axes  of  the 
ellipsoid  correspond  to  the  components  used  to  generate  the 
data.  Both  these  conditions  will  be  met  if  the  following 
statements  are  true: 

S.l  The  dimensions  defining  each  component  are  orthogonal. 
That  la,  there  Is  oo  correlation  between  (g[k](t))  and 
(l(k](t'))i  calculated  over  t. 


3.2  The  ERPs  are  defined  as  In  equation  (5).  This  means 
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that  ths  standard  forns  (g[k](t))  are  Invariant  over  all 
records. 


S.3  The  aaplltudes  {  A[l,k]  )  of  the  ktb  ooaponent  In  each 
of  the  I  Sifs  sre  established  by  taking  i  Independent  sanples 
fron  a  noraal  distribution  with  ezpectstlon  E(A[k])  snd 
variance  V(l[k]). 

3.  A  The  error  terna  e[l](t)  are  deteralned  by 
Independently  aaapllng  froa  a  population  with  expectation 
E(e(t))  ■  0  and  varlanoe  T(e(t)). 


Figure  2  here 


gaoh  of  the  assuaptloae  can  be  questioned.  The  effeet  of 
relaxing  then  will  be  considered  In  the  next  section.  Before 
prooeedlng  to  the  critique,  though,  one  aore  issue  concerning 
the  noraal  use  of  PCI  needs  to  be  considered.  The  wave  foraa 
reoovered  directly  froa  PCA  tend  to  be  shallow,  and  any  contain 
both  positive  and  negative  ooaponents,  even  when  rather  sharply 
defined,  single  peaked  ooaponents  appear  in  the  graphic  record. 
Just  why  this  should  happen  Is  not  clear.  One  possibility  is 
that  the  data  points  are  being  forced  Into  an  elliptical  fora 
by  the  analytic  procedure,  even  though  the  best  fitting  ellipse 
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la  a  poor  description  of  tbs  data.  (An  example  of  hot*  this 
■lght  occur  la  dlaouaaad  below).  A  second  possibility  Is  that 
tbs  data  doss  have  a  generally  elliptical  shape,  but  that  the 
estimation  procedure  used  by  PCA  does  not  well  recover  that 

shape. 


A  procedure  known  as  *varlaazlng*  Is  used  to  produce  sore 
sharply  defined  wave  foras.  What  varlnaxlng  does  Is  to  rotate 
the  discovered  axes  so  that  they  are  approxlaately  parallel  to 
a  few  tlae  axes  and,  aore  laportantly,  so  that  they  have  sero 
loadings  on  (l.e.  are  orthogonal  to)  aany  tlae  axes.  Such  a 
procedure  avoids  shallow  wavoforas.  To  see  why,  consider  the 
aatheaatlcal  meaning  of  the  "shallow*  waves,  recovered  by  PCA. 
Za  a  shallow  wave  the  eoeffeolents  of  the  standard  fora,  the 
values  In  the  set  (  g[k)(t)  ),  are  approxlaately  equal  In  size. 
As  a  result,  the  varlaneo  of  those  numbers,  V(g[k]),  will  be 
aaall.  This  ooatrasts  with  the  numbers  (  g[k](t)  )  derived 
froa  a  sharply  peaked  function,  which  will  have  a  higher 
variance.  The  varlaax  procedure  rotates  the  component  axes  to 
aaxlalae  the  sun  of  tho  variances  aorosa  all  ooaponents,  l.e. 

It  aaxlalses 

K 

(10)  f  e^  T(gtk]). 
ke  1 

The  rotation  procedure,  however,  retains  the  requirement  that 


I 
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the  eoaponent  axes  be  orthogonal.  Subject  to  thla  constraint, 
varlaaxlng  will  always  produce  a  sharp  function.  It  does  so  by 
using  a  aatheaatlcal  criterion  for  evaluating  a  function 
(wavefora).  The  aatheaatloal  criterion  used  does  not  have  a 
cleareut  physiological  Interpretation. 

Critique 

Principal  coaponent  analyala  solves  the  aatheaatlcal 
problea  for  which  It  la  defined.  Any  quarrel  with  PCA  as  a 
aethod  of  analysis  aust  focus  on  the  problea  stateaent;  are 
equation  (5)  and  aaauaptlons  3.1-S.4  reasonable  approxlaatlons 
of  the  way  In  which  the  psychophysiologlcal  data  waa  gathered? 

Scan  argaaants  against  the  PCA  aaauaptlons  will  now  ha 
offered.  Por  ease  of  reading,  each  assuaptlon  will  be  restated 
la  abbreviated  fora  prior  to  coaaeatlng  upon  it. 

3.1.  Ortbaannalltv  The  coaponent  dlaenalons  are  orthogonal. 

Thla  assuaptlon  is  highly  suspeet.  Matheaatloally ,  a 
aeoesaary  and  sufficient  eoadltloa  for  orthogonality  of  the  ktb 
end  4th  eeaponent  la  that  there  he  no  oorrelatloa  between  the 
aaplltudes  of  the  ktb  and  Jth  ooaponent,  when  the  correlation 
la  ooaputed  aoross  Individuals.  The  ooaputatlonal  aethods  used 
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by  PCA  ensure  that  tha  discovered  coaponants  satisfy  the 
orthogonality  raqulraaant.  But  eonaldar  tha  physiological 
Intarpratatlon  of  eoaponeat  anplltudaa.  Tha  tarns  A[l,k)  and 
1(1, JJ  raprasant  tha  anplltudaa  of  tvo  different  coaponants, 
recorded  froa  tha  sane  electrode,  recording  fron  the  sane 
brain.  Any  general  process  that  affaota  tha  excursion  of 
eleetrloal  activities  (not  the  average  potential)  ought  to 
affect  both  oonpoaente.  laterally  this  would  produce  a 
correlation  across  records. 

If  the  KIP  la  produced  by  correlated  oonponents,  the  use 
of  PCA  ought  to  produce  fewer  coaponants  than  ar*  actually 
present.  The  dlaoovered  coaponants  should  ba  nlxtures  of  the 
true  oonponents.  Interestingly ,  varlaaxlng  nay  halp  In  such 
situations.  If  the  first  conponent  discovered  In  a  PCA  analysis 
represents  a  eoaproalae  between  several  correlated  coaponants, 
varlaaxlng  ought  to  nova  tha  largest  (first  extracted)  PCA 
ooaponent  towards  the  largaat  of  the  underlying  true 
eoapoaenta.  lowever  the  relation  between  the  true  coaponants 
and  the  subsequently  extracted  PCA  coaponeats  la  unclear. 

3.2  The  aaae  standard  wave  fora  applies  to  all  KIP  record*. 


Interesting  oases.  Suppose  one's  saaple  is  node  up  ef  peopls 
who  vary  widely  la  age.  There  is  a  substantial  body  of 


literature  Indicating  that  thara  ara  latency  differences  In  tha 
HP's  recorded  froa  young  and  elderly  bralna  (Pord  and 
Pfefferbaua,  1980),  and  possibly  other  dlffereneea  In  ahapa  aa 
Mall.  All  of  thaaa  would  be  reflected  aa  ebangaa  in  atandard 
foraa.  A  alallar  oaaa  could  be  aada  If  Individual  racorda  ara 
taken  froa  different  pointa  on  the  akull.  If  the  various  ERP 
eoaponenta  eaanate  froa  different  pointa  In  the  brain,  one 
would  aspect  the  latenelea  of  their  appearancea  to  be 
Influenced  by  the  placeaent  of  the  recording  electrode. 

Fallurea  of  aaauaptlon  S.2  any  be  partly  raaponalble  for 
the  produotlon  of  aflet*  ooaponent  foraa  by  the  uae  of  PCA 
without  rotation.  Suppoae  that  the  IPP  record  la  actually 
produced  by  a  alngle  sharply  peaked  coaponent  wave  that  varies 
la  lateaoy  over  individuals.  This  la  shown  in  Figure  3,  panel 
a.  FCA  will  average  the  fora  over  records,  producing  a  flat 
wave,  alallar  to  that  abowa  la  Flguro  3,  panel  b.  farlaazlng 
aay  partially  correct  for  tho  situation,  by  aovlng  the 
discovered  wave  fora  toward  the  fora  that  exists  In  aost  of  the 
records.  While  this  la  usually  deaireable,  the  nature  of  the 
averaging  process  la  not  clear.  First,  the  averaging  Interacts 
with  tho  orthogonality  rogulreaont,  so  that  after  the  flrat  FCA 
ooaponent  has  been  defined  It  la  not  dear  what  la  being 
averaged.  Seoond,  and  perhaps  aore  laportaat,  the  resulting 
aaplltudea  do  not  correctly  reflect  (and  In  general  will 
undereetlnate)  the  aaplltude  of  the  coaponent  wave  fora  In 
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abarrant  Individuals.  Finally,  and  perhaps  aoat  important,  the 
aathod  bldaa  Individual  dlffarenoea  In  coaponent  uave  forna. 
These  aay  ba  of  lntaraat  In  thaasalvaa. 


Figure  3  her* 


S.3  The  aaplltudas  {  A[l,k]  }  are  deteralned  by  Independent 
eaapllng  from  a  noraal  distribution  with  aean  u(k)  and  variance 
*(k). 


This  la  a  particularly  laportant  asauaptlon  to  consider, 
for  It  appears  that  FCA  la  often  practiced  In  situations  where 
S.3  la  likely  to  ba  falsa. 

Suppose  that  FCA  is  conducted  on  a  data  set  consisting  of 
N [  1  ]  records  drawn  froa  population  1  and  H[2]  records  fron 
population  2.  For  exaaplo,  sob*  of  the  records  sight  be  drawn 
froa  hoalthy  participants,  and  other  records  drawn  froa 
alcohollos.  This  will  bo  oallod  tbs  'population*  design  for 
brovity.  Suppose  further  that  asauaptlon  S.3  Is  true  within 
oaoh  population.  Zn  tbla  oas*  th*  data  points  fron  each 
population  should  traes  out  tholr  own  ellipsoid  foras  In  th* 
Euclidean  tins  representation.  This  is  shown  by  th*  solid 
linos. In  Figure  A. 


i 


vawe  •» 
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figure  A  here 


Under  such  conditions  atntonont  S.3  oannot  bo  true  of  tbo 


:*v 


data  aat  aa  a  whole,  unloaa  tbo  two  populations  ara  drawn  free 
Identical  distributions.  If  PCA  la  applied  to  the  entire  data 
set  tbe  computer  progran  will  define  a  •beat  fitting  ellipse* 
for  the  entire  aawple  that  nay  not  be  characteristic  of  either 
population.  An  exaaple  la  shown  by  the  dotted  line  in  Figure 
A.  In  fact,  the  *eoaponoat*  defined  by  PCA  In  a  population 
design  la  likely  to  be  that  linear  combination  of  tine  polnta 
that  beat  dlacrlwlnatea  between  the  two  groups.  This  la  a 
aajor  problea  If.  as  la  aoaetlaas  dona,  aeaaurea  derived  froa 
PCA. .ueually  the  aaplltwdea. .are  aaed  to  diaerlalaate  baweea 
the  growpa.  (Per  oxaapla,  oaa  Bight  porfora  an  analyals  of 
varlaaoo  on  the  aaplltwdea  of  a  eoapoaeat  that  waa  defined  by 
an  analysis  or  all  the  data.)  In  aueh  a  design  group  Identity 
la  the  Independent  variable  and  the  aeasura  derived  frea  the 
PCA  la  the  dependent  variable.  Neat  atatlstleal  tests  de  net 


apply,  bee 


variable. 


Oeoaaloaaly  a  study  la  reported  In  which  the  data  set 
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oonsiata  of  R[1]  electrode  recordings  fro*  aach  of  M [ 2 ) 
subjects,  for  a  total  of  R[1j  x  S[2]  racorda.  This  will  b* 
oallad  tha  "Individuals*  daalgn.  On*  aotlvatlon  for  th* 
Individuals  daalgn  la  that  R  Bust  b*  graatar  than  T  In  ordar  to 
produea  any  PCI  solution,  and  R  should  b*  approximately  5  x  T 
to  produo*  a  aathaaatleally  atabl*  solution.  In  an  Individuals 
daalgn  tha  ERP  racorda  froa  aaeh  parson  can  ba  regarded  aa 
defining  a  ooherent,  related  aubsat  of  th*  data  set.  Exactly 
th*  aaaa  considerations  apply  to  tha  subsets  as  applied  to  th* 
groups  In  tha  population  design.  The  data  points  froa  ech 
Individual  aay  fora  their  own  alllpaa,  but  tha  PCI  progran  will 
disregard  this.  Instead,  aa  In  the  caa*  of  tha  populations 
problaa,  PCS  will  aaphnslxe  coaponants  that  define  differences 
batwaan  individuals,  at  tha  expanse  of  ignoring  coaponants  that 
are  doalnant  in  every  parson. 

Stateaont  S3  any  "fall"  aoro  subtly.  Suppose  that  tha 
aaplltudos  of  different  coaponants  co-vary  across  individuals. 
Suoh  a  situation  would  arias  if  thsrs  ware  any  Individual 
charaotaristioa  that  affsotsd  th*  aaplltudos  of  tha  excursion 
of  all  alaotrloal  activity  in  a  person's  brain. 

Mathaaatlcally .  this  situation  would  produce  a  'general'  factor 
q(i],  suoh  that  tha  aaplitud*  of  tbs  kth  ooaponant  In  tha  ith 


Individual  would  b* 
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Mhsrs  Q(l]  Is  ths  1th  Individual's  standardised  aeors  on  ths 
general  conponsnt,  C[k]  is  ths  contribution  of  ths  general 
conponont  to  ths  kth  apsclfle  conponsnt,  and  A'[l,k]  Is  ths 
Individual's  valus  on  ths  specific  part  of  ths  kth  conponsnt. 
Precisely  how  PCA  will  analyse  this  situation  will  depend  upon 
the  values  of  the  coeffeclents  (C[k]J.  The  point  to  note  here 
Is  that  the  elliptical  fora  of  the  data  points  will  be 
distorted  by  the  eslstesoe  of  the  general  conponsnt. 

This  effect  nay  lntsraet  with  distortions  Introduced  by 
the  analysis  of  two  groups  as  If  they  were  a  distinct 
population,  as  discussed  above. 

3.8  The  error  toms,  (  e[l](t)  )•  are  Independently 
drawn  froa  noraal  populations  with  assn  0  end  a  variance,  v(t), 
that  is  oharooterlstlo  of  t. 

This  asaunptlon  Is  osatral  to  PCA.  It  la  alnoat  eertaln 
to  ho  false  If  the  rooords  to  bo  analysed  are  ths  ItPs  taken  on 
Individual  trials.  A  necessary  eoaditlon  for  3. A  to  bo  true  Is 
that  rsndoa  events  that  Influence  a  rooordlng  at  tine  t  bo 
atstlatloally  Independent  of  rsndoa  events  Influencing  the 
reoordlng  at  tine  t«.  Purtheraore,  the  events  Influencing  the 
residuals  on  one  recording,  1,  should  be  statistically 
Independent  of  the  events  Influencing  recording  J. 
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Thla  condition  la  na«ar  not  at  the  level  of  Individual  EBP 
recorda,  for  any  avant  that  cauaed  an  error  In  recording  at 
tine  t  would  alaoat  certainly  have  consequences  that  would 
Influence  rocordlnga  In  the  lnnedlately  subsequent  tine 
periods.  It  la  trivially  easy  to  altlgate  such  effects  by 
averaging  over  trials,  and  using  the  averaged  EBP  as  the  record 
to  ho  analysed.  The  average  proeeaa  la  surprisingly  efficient, 
as  can  he  shown  by  the  Investigation  of  extrene  cases. 

Suppose,  for  instance  that  the  potentials  recorded  at  tinea  t 
and  tel  have  a  'true*  correlation,  (l.o.  correlation  due  to 
coMon  underlying  coapononts)  of  .3,  and  the  correlation 
between  error  effects  la  .9$.  Suppose  further  that  the 
varlaaee  of  the  error  distribution  la  four  tines  the  variance 
of  the  underlying  signal,  l.e.  a  slgnal/nolse  ratio  of  1st. 

The  correlation  between  single  reoorda  has  an  expected  value  of 
.ft.  alaeat  entirely  dne  to  the  error  ceaponeat.  if  the 
reeerda  are  oeapesed  of  an  average  of  100  trials  the 
eerrelatlens  drops  to  .33,  close  to  the  true  value  and  It  drops 
further  to  .31  after  averaging  over  500  trials.  Clearly  the 
averaging  process  oan  drastloally  reduce  the  effects  of 
violations  of  the  atateaent  S.t. 

Unfortunately  averaging  has  no  offsets  on  stateaents  SI, 

S2  and  S3<  Any  violation  of  those  statenents  In  EBP  records 
obtained  on  single  trials  will  also  apply  to  the  records 
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obtainod  by  averaging  over  trial*.  It  ia  therefore  raaaoaabla 
to  ask  what  tba  offsets  of  those  violations  is.  Tho  only 
foasibl*  way  to  answer  this  question  is  to  analyse  aslaulateda 
records,  obtained  by  auaaiag  known  ooaponants,  plus  error,  into 
apsu*do-EBPa  and  than  seeing  if  PCA  oen  eztreot  the  original 
coaponents  free  tho  oonatrustad  reoords. 


Wood  end  HoCarthy  (in  press)  have  raportad  on*  anoh  study, 
that  was  concerned  priaarily  with  violations  of  statoaont*  3. 1 
(orthogonality  of  tba  wav*  'fores )  and  S.3  (ooaponont  aaplitudoa 
defined  by  saapllng  froa  a  single  population.)  Wood  and 
McCarthy  found  that  in  this  situation  PCA  oould  recover  tha 
fores  of  tho  coaponant  waves  quits  wall,  but  that  it  night 
substantially  nlaallooate  tha  percentage  of  tho  varlaaoo 
accounted  for  by  all  except  tho  first  coaponant  extracted  by 
tba  analysis.  In  praotlco,  this  aaant  that  PCA  did  not 
correctly  Identify  tho  coaponant  aaplitudoa  in  Individual 
records  (tha  values  of  (1(1,1]),  k  2)  for  tha  later 
coaponents.  Hood  and  McCarthy  further  found  that  atatlatioal 
analyses  of  Inter-group  dlfferencea  In  tho  aean  values  definsd 
PCA-dlaoovarad  eonponenta  did  not  always  agree  with  a  parallel 
analysis  of  tho  (known)  ooaponants  used  to  genarata  the 
simulated  IBP  records.  Tho  latter  result  would  bo  expected  if 
tha  Individual  coaponant  aaplltuds  values  wars  ala-estlaated. 


My  colleagues  and  I  have  replicated  Hood  and  McCarthy, 
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using  s  slight  variation  of  thslr  techniques  for  gsnsratlng 
HI*'  sad  hava  obtalnsd  ssasntlally  tha  sans  rsaults.  In 
addition,  ws  hava  eonduetad  an  analysis  of  ths  offset  of 
eorralatad  eoaponont  aaplltudes.  • Psuedo-ERPs '  were  gonorstsd 
using  tha  throe  eoaponont  wavs  studied  by  Hood  and  McCarthy, 
but  with  tha  addition  of  a  randoaly  generated  'general  factor", 
(Qtll  In  aquation  (11))  to  eaeh  of  tha  aaplltudes  (A[l,k), 
hs1...3).  Tha  varlanoa  of  tha  Q[l)'s  was  chosen  to  induce  a 
correlation  of  .15  batwaaa  aaplltudes  on  two  of  the  coaponenta, 
calculated  acroaa  records  (1).  Table  1  presents  the  results  of 
this  study.  Zt  shows  the  correlations  between  the  aaplltudes 
of  anah  eoaponent,  as  assigned  In  the  developaent  of  tha  CRT 
raoord,  and  tha  aaplltuda  recovered  by  PCA  followed  by 
varlaaalng.  Thaaa  correlations  are  unacceptably  low.  It  la 
worth  acting  that  thla  result  is  clearly  due  to  the 
Introduction  of  a  correlation  between  eoaponent  aaplltudes.  A 
repetitlea  ef  the  slaulatloa  using  uncorrelated  aaplltudes 
predueed  oorrelatioas  between  the  'true'  and  'recovered' 
aaplltadea  that  were  all  in  esooss  of  .95. 


Table  1  Here 


Thla  result  raises  a  question  about  the  uae  of  PCA  both 
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analyala  disregards  the  faet  that  subjectaxall  Into  dlstlnot 
groups.  If  the  groups  or  Individuals  dlffsr  lnS^helr 
aaplltudes  of  aors  than  ona  component,  then  a  corrsraUon 
between  eoaponent  aaplltudes  will  autoaatleally  be  Introduced 
by  the  exparlasntal  design  (Figure  A).  Thus  the  praotlce  of 
applying  PCA  to  all  the  reeorda  obtainsd  In  either  a 
aultl-group  or  'Individuals'  design  is  highly  suspaot. 

Conclusion 

The  aathaaatlcal  assuaptlons  upon  which  the  PCA  Is  based 
do  not  appear  to  describe  the  way  that  data  is  ganaratsd  In  a 
typical  KIP  study.  Za  spite  of  these  violations,  the 
eoabinatlon  of  PCA  and  varlaax  rotation  appears  to  be  able  to 
oaptura  the  fora  of  the  underlying  wave  foras  fairly  wall, 
■owavar  the  aatlaatloa  of  the  aaplltudes  of  component  waves  in 
individual  reeorda  aay  ha  orroaaous.  Both  the  logical 
arguaaata  prssantad  hare  and  tho  alaulatlon  results  of  Wood  and 
McCarthy  Indicate  that  tho  problaa  la  graataat  for  sstlaatioa 
of  the  latter  ooaponeats  astlaatad  hy  PCA,  aad  la  aocsntuatad 
if  tho  underlying  ooaponaat  loadings  and/or  aaplltudes  are 
correlated. 

For  these  reseoae  it  in  suggested  that  PCA  results  should 
be  treated  with  aoae  akeptlolaa.  Zt  la  worth  noting  that  aueh 
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rsthsr  than  on  Instabilities  of  solution.  Thus  showing  that 
PCA  results  can  be  repeated  Is  aoaewhat  beside  the  point.  The 
biases  will  affect  each  replication. 

This  critique  should  not  obscure  what  Is  perhaps  the  key 
point  of  Donchln  and  Heffeley's  original  proposal.  The 
analysis  of  ERP  records  can  be  regarded  as  a  problea  in 
aultlvarlate  analysis.  The  problea  Is  to  find  an  approach 
within  aultlvarlate  analysis  that  does  not  depend  upon 
aatheaatlcal  assuaptlons  that  have  unreasonable  physiological 
laplloatloae.  The  use  of  PCA  was  a  reasonable  first  step  in 
the  searoh  for  such  a  procedure,  but  It  has  soae  Inadequacies. 
Other  aultlvarlate  analysis  taehnlquea  that  aay  be  aore 
satisfactory.  In  a  subsequent  paper  I  hope  to  report  an 
alternative  aultlvarlate  technique  that  can  be  applied  to  ERP 
records  without  asking  the  strong  astheastlcsl  assuaptlons 
Inherent  In  the  use  of  PCA. 
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Table  1 


Aaplltudea  on  Coapoaenta 
Recovered  r roa  pci 

Aaplltudaa  on  original 


inaata 

I 

II 

III 

I 

.776 

.049 

.069 

II 

.035 

.772 

.036 

III 

.009 

.429 

.764 

Corralatloaa  batwoon  racovarad  and  original  eoapoaanta  whoa  the 
original  ooaponaat  aaplltudaa  I  aad  II  warn  corralated  aeroaa 


rooorda 


Figure  Legends 


Figure  1 
Figure  2 
Figure  3 

Figure  4 


ERPs  plotted  on  two  tine  axes. 

Point  plotted  on  tine  axes  (t-1  and  2)  and  component  axes. 

Two  overlapping  ERPs  (top)  and  their  average  signal  produced 
by  them^ battery 

Illustration  of  the  effect  of  doing  PCA  on  two  distinct  groups, 
treated  as  one  data  set.  The  "best  fitting  ellipse**  for  the 
data  as  a  whole  nay  not  be  illustrative  of  the  data  for  the 
two  groups  individually. 


